An improved general amino acid replacement matrix.

نویسندگان

  • Si Quang Le
  • Olivier Gascuel
چکیده

Amino acid replacement matrices are an essential basis of protein phylogenetics. They are used to compute substitution probabilities along phylogeny branches and thus the likelihood of the data. They are also essential in protein alignment. A number of replacement matrices and methods to estimate these matrices from protein alignments have been proposed since the seminal work of Dayhoff et al. (1972). An important advance was achieved by Whelan and Goldman (2001) and their WAG matrix, thanks to an efficient maximum likelihood estimation approach that accounts for the phylogenies of sequences within each training alignment. We further refine this method by incorporating the variability of evolutionary rates across sites in the matrix estimation and using a much larger and diverse database than BRKALN, which was used to estimate WAG. To estimate our new matrix (called LG after the authors), we use an adaptation of the XRATE software and 3,912 alignments from Pfam, comprising approximately 50,000 sequences and approximately 6.5 million residues overall. To evaluate the LG performance, we use an independent sample consisting of 59 alignments from TreeBase and randomly divide Pfam alignments into 3,412 training and 500 test alignments. The comparison with WAG and JTT shows a clear likelihood improvement. With TreeBase, we find that 1) the average Akaike information criterion gain per site is 0.25 and 0.42, when compared with WAG and JTT, respectively; 2) LG is significantly better than WAG for 38 alignments (among 59), and significantly worse with 2 alignments only; and 3) tree topologies inferred with LG, WAG, and JTT frequently differ, indicating that using LG impacts not only the likelihood value but also the output tree. Results with the test alignments from Pfam are analogous. LG and a PHYML implementation can be downloaded from http://atgc.lirmm.fr/LG.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Amino Acid Substitution Matrices

• Biophysical properties of residues: Amino acids differ in size and charge. Some are acidic, some are basic, some have aromatic side chains. Generally, replacement of an amino acid with another amino acid with similar properties is less likely to break the protein or cause dramatic changes in function than replacement with an amino acid with different properties. A substitution matrix should r...

متن کامل

Amino Acid Substitution Matrices Tuesday

• Biophysical properties of residues: Amino acids differ in size and charge. Some are acidic, some are basic, some have aromatic side chains. Generally, replacement of an amino acid with another amino acid with similar properties is less likely to break the protein or cause a dramatic change in function than replacement with an amino acid with different properties. A substitution matrix should ...

متن کامل

Effect of solvent extracted soybean meal and full-fat soya on the protein and amino acid digestibility and body amino acid composition in rainbow trout (Oncorhynchus mykiss)

 This study was carried out to investigate the apparent digestibility coefficients (ADCs) value of protein, amino acid and energy and body amino acid composition of rainbow trout fed solvent extracted soybean meal (SBM) and full-fat soybean meal (FFS) partly replacing fish meal (FM) in diets. Five iso nitrogenous (average 50.36% crude protein) and energetic (4294 kcal/kg total energy) diets wer...

متن کامل

Effect of solvent extracted soybean meal and full-fat soya on the protein and amino acid digestibility and body amino acid composition in rainbow trout (Oncorhynchus mykiss)

 This study was carried out to investigate the apparent digestibility coefficients (ADCs) value of protein, amino acid and energy and body amino acid composition of rainbow trout fed solvent extracted soybean meal (SBM) and full-fat soybean meal (FFS) partly replacing fish meal (FM) in diets. Five iso nitrogenous (average 50.36% crude protein) and energetic (4294 kcal/kg total energy) ...

متن کامل

Stochastic Analysis of Amino Acid Substitution in Protein Synthesis

We present a formal analysis of amino acid replacement during mRNA translation. Building on an abstract stochastic model of arrival of tRNAs and their processing at the ribosome, we compute probabilities of the insertion of amino acids into the nascent polypeptide chain. To this end, we integrate the probabilistic model checker Prism in the Matlab environment. We construct the substitution matr...

متن کامل

A combined empirical and mechanistic codon model.

The evolutionary selection forces acting on a protein are commonly inferred using evolutionary codon models by contrasting the rate of synonymous to nonsynonymous substitutions. Most widely used models are based on theoretical assumptions and ignore the empirical observation that distinct amino acids differ in their replacement rates. In this paper, we develop a general method that allows assim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 25 7  شماره 

صفحات  -

تاریخ انتشار 2008